A high performance implementation of MPI-IO for a Lustre file system environment

نویسندگان

Phillip M. Dickens

Jeremy S. Logan

چکیده

It is often the case that MPI-IO performs poorly in a Lustre file system environment, although the reasons for such performance have heretofore not been well understood. We hypothesize that such performance is a direct result of the fundamental assumptions upon which most parallel I/O optimizations are based. In particular, it is almost universally believed that parallel I/O performance is optimized when aggregator processes perform large, contiguous I/O operations in parallel. Our research, however, shows that this approach can actually provide the worst performance in a Lustre environment, and that the best performance may be obtained by performing a large number of small, non-contiguous I/O operations. In this paper, we provide empirical results demonstrating these non-intuitive results and explore the reasons for such unexpected performance. We present our solution to the problem, which is embodied in a user-level library termed Y-Lib, which redistributes the data in a way that conforms much more closely with the Lustre storage architecture than does the data redistribution pattern employed by MPI-IO. We provide a large body of experimental results, taken across two large-scale Lustre installations, demonstrating that Y-Lib outperforms MPI-IO by up to 36% on one system and 1000% on the other. We discuss the factors that impact the performance improvement obtained by Y-Lib, which include the number of aggregator processes and Object Storage Devices, as well as the power of the system’s communications infrastructure. We also show that the optimal data redistribution pattern for Y-Lib is dependent upon these same factors. Copyright © 2009 John Wiley & Sons, Ltd.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a High Performance Implementation of MPI-IO on the Lustre File System

Abs tra ct—Lustre is becoming an increasingly important file system for large-scale computing clusters. The problem is that many dataintensive applications use MPI-IO for their I/O requirements, and it has been well documented that MPI-IO performs poorly in a Lustre file system environment. However, the reasons for such poor performance are not currently well understood. We believe that the pri...

متن کامل

A scalable MPI implementation of an ad-hoc parallel I/O system

This paper presents the novel design, implementation and evaluation of an ad-hoc parallel I/O system (AHPIOS). AHPIOS is the first scalable parallel I/O system completely implemented in Message Passing Interface (MPI). The MPI implementation brings the advantages of portability, scalability and high-performance. AHPIOS allows MPI applications to dynamically manage and scale distributed partitio...

متن کامل

A Scalable Message Passing Interface Implementation of an Ad-Hoc Parallel I/o system

In this paper we present the novel design, implementation, and evaluation of an ad-hoc parallel I/O system (AHPIOS). AHPIOS is the first scalable parallel I/O system completely implemented in the Message Passing Interface (MPI). The MPI implementation brings the advantages of portability, scalability and high performance. AHPIOS allows MPI applications to dynamically manage and scale distribute...

متن کامل

LIOProf: Exposing Lustre File System Behavior for I/O Middleware

As parallel I/O subsystem in large-scale supercomputers is becoming complex due to multiple levels of software libraries, hardware layers, and various I/O patterns, detecting performance bottlenecks is a critical requirement. While there exist a few tools to characterize application I/O, robust analysis of file system behavior and associating file-system feedback with application I/O patterns a...

متن کامل

A First Implementation of Parallel IO in Chapel for Block Data Distribution

This paper presents our preliminary implementations of parallel IO routines in Chapel, a high-productivity parallel language for large-scale systems. The IO functions are implemented using standard Chapel features, taking POSIX as the IO middleware layer and Lustre as the target parallel file system. In this work, we focus on the Chapel Block data distribution, for which we propose different al...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Concurrency and Computation: Practice and Experience

دوره 22 شماره

صفحات -

تاریخ انتشار 2010

A high performance implementation of MPI-IO for a Lustre file system environment

نویسندگان

چکیده

منابع مشابه

Towards a High Performance Implementation of MPI-IO on the Lustre File System

A scalable MPI implementation of an ad-hoc parallel I/O system

A Scalable Message Passing Interface Implementation of an Ad-Hoc Parallel I/o system

LIOProf: Exposing Lustre File System Behavior for I/O Middleware

A First Implementation of Parallel IO in Chapel for Block Data Distribution

عنوان ژورنال:

اشتراک گذاری